首页> 外文OA文献 >Computing on Masked Data: a High Performance Method for Improving Big Data Veracity
【2h】

Computing on Masked Data: a High Performance Method for Improving Big Data Veracity

机译:掩盖数据计算:一种改进大数据的高性能方法   数据准确性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The growing gap between data and users calls for innovative tools thataddress the challenges faced by big data volume, velocity and variety. Alongwith these standard three V's of big data, an emerging fourth "V" is veracity,which addresses the confidentiality, integrity, and availability of the data.Traditional cryptographic techniques that ensure the veracity of data can haveoverheads that are too large to apply to big data. This work introduces a newtechnique called Computing on Masked Data (CMD), which improves data veracityby allowing computations to be performed directly on masked data and ensuringthat only authorized recipients can unmask the data. Using the sparse linearalgebra of associative arrays, CMD can be performed with significantly lessoverhead than other approaches while still supporting a wide range of linearalgebraic operations on the masked data. Databases with strong support ofsparse operations, such as SciDB or Apache Accumulo, are ideally suited to thistechnique. Examples are shown for the application of CMD to a complex DNAmatching algorithm and to database operations over social media data.
机译:数据和用户之间日益扩大的差距要求使用创新的工具来应对大数据量,速度和多样性所面临的挑战。与大数据的这三个标准V一起,真实性出现了第四个“ V”,它解决了数据的机密性,完整性和可用性。确保数据真实性的传统加密技术可能会有太大的开销,无法应用于大数据数据。这项工作引入了一种新技术,称为“基于屏蔽数据的计算”(CMD),它允许直接在屏蔽数据上执行计算并确保只有授权的接收者才能对数据进行屏蔽,从而提高了数据准确性。使用关联数组的稀疏线性代数,可以以比其他方法少得多的开销来执行CMD,同时仍然支持对掩码数据进行广泛的线性代数运算。强烈支持稀疏操作的数据库(例如SciDB或Apache Accumulo)非常适合此技术。给出了将CMD应用到复杂的DNA匹配算法以及社交媒体数据上的数据库操作的示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号